Go top
Paper information

Understanding the influence of data characteristics on the performance of point-of-interest recommendation algorithms

L.W. Dietz, P. Sánchez, A. Bellogín

Information Technology & Tourism

Summary:

Point-of-interest (POI) recommendations are essential for travelers and the e-tourism business. They assist in decision-making regarding what venues to visit and where to dine and stay. While it is known that traditional recommendation algorithms’ performance depends on data characteristics like sparsity, popularity bias, and preference distributions, the impact of these data characteristics has not been systematically studied in the POI recommendation domain. To fill this gap, we extend a previously proposed explanatory framework by introducing new explanatory variables specifically relevant to POI recommendation. At its core, the framework relies on having subsamples with different data characteristics to compute a regression model, which reveals the dependencies between data characteristics and performance metrics of recommendation models. To obtain these subsamples, we subdivide a POI recommendation data set on New York City and measure the effect of these characteristics on different classical POI recommendation algorithms in terms of accuracy, novelty, and item exposure. Our findings confirm the crucial role of key data features like density, popularity bias, and the distribution of check-ins in POI recommendation. Additionally, we identify the significance of novel factors, such as user mobility and the duration of user activity. In summary, our work presents a generic method to quantify the influence of data characteristics on recommendation performance. The results not only show why certain POI recommendation algorithms excel in specific recommendation problems derived from a LBSN check-in data set in New York City, but also offer practical insights into which data characteristics need to be addressed to achieve better recommendation performance.


Spanish layman's summary:

Analizar la densidad, sesgo de popularidad y distribuciones de check-ins es clave para entender su impacto en la experiencia de usuarios en recomendaciones de POIs. Proponemos un framework con subsamples y regresión que muestra cómo estas características afectan a la precisión, novedad y exposición.


English layman's summary:

Analyzing sparsity, popularity bias, and check-in distributions is key to understanding their impact on user experience in POI recommendations.We propose a framework with subsamples and regression that reveals how these features shape accuracy, novelty, and exposure in recommendation algorithms.


Keywords: Point-of-interest recommendation · Ofine evaluation · Regression analysis · Data characteristics


JCR Impact Factor and WoS quartile: 6,300 - Q1 (2023)

DOI reference: DOI icon https://doi.org/10.1007/s40558-024-00304-0

In press: January 2025.



Citation:
L.W. Dietz, P. Sánchez, A. Bellogín, Understanding the influence of data characteristics on the performance of point-of-interest recommendation algorithms. Information Technology & Tourism.


    Research topics:
  • Data analytics